Learning to Search on Manifolds for 3D Pose Estimation of Articulated Objects

نویسندگان

  • Yu Zhang
  • Chi Xu
  • Li Cheng
چکیده

This paper focuses on the challenging problem of 3D pose estimation of a diverse spectrum of articulated objects from single depth images. A novel structured prediction approach is considered, where 3D poses are represented as skeletal models that naturally operate on manifolds. Given an input depth image, the problem of predicting the most proper articulation of underlying skeletal model is thus formulated as sequentially searching for the optimal skeletal configuration. This is subsequently addressed by convolutional neural nets trained end-to-end to render sequential prediction of the joint locations as regressing a set of tangent vectors of the underlying manifolds. Our approach is examined on various articulated objects including human hand, mouse, and fish benchmark datasets. Empirically it is shown to deliver highly competitive performance with respect to the state-of-the-arts, while operating in real-time (over 30 FPS).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3D Pose Estimation, Tracking and Model Learning of Articulated Objects from Dense Depth Video using Projected Texture Stereo

Service robots deployed in domestic environments generally need the capability to deal with articulated objects such as doors and drawers in order to fulfill certain mobile manipulation tasks. This however, requires, that the robots are able to perceive articulated furniture objects such as cupboards, dishwashers and cabinets. In this paper, we present an approach for detecting, tracking, and l...

متن کامل

Parametrized SOMs for Object Recognition and Pose Estimation

We present the “Parametrized Self-Organizing Map” (PSOM) as a method for 3D object recognition and pose estimation. The PSOM can be seen as a continuous extension of the standard SelfOrganizing Map which generalizes the discrete set of reference vectors to a continuous manifold. In the context of visual learning, manifolds based on PSOMs can be used to represent the appearance of various object...

متن کامل

Continuous - state Graphical Models for Object Localization , Pose Estimation and Tracking

of “Continuous-state Graphical Models for Object Localization, Pose Estimation and Tracking” by Leonid Sigal, Ph.D., Brown University, May 2008. Reasoning about pose and motion of objects, based on images or video, is an important task for many machine vision applications. Estimating the pose of articulated objects such as people and animals is particularly challenging due to the complexity of ...

متن کامل

A generalized parametric 3D shape representation for articulated pose estimation

We present a novel parametric 3D shape representation, Generalized sum of Gaussians (G-SoG), which is particularly suitable for pose estimation of articulated objects. Compared with the original sum-of-Gaussians (SoG), G-SoG can handle both isotropic and anisotropic Gaussians, leading to a more flexible and adaptable shape representation yet with much fewer anisotropic Gaussians involved. An ar...

متن کامل

Shape Models of the Human Body for Distributed Inference

of “Shape Models of the Human Body for Distributed Inference” by Silvia Zuffi, Ph.D., Brown University, May 2015 In this thesis we address the problem of building shape models of the human body, in 2D and 3D, which are realistic and efficient to use. We focus our efforts on the human body, which is highly articulated and has interesting shape variations, but the approaches we present here can b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1612.00596  شماره 

صفحات  -

تاریخ انتشار 2016